Analysis of Laugh Signals for Automatic Detection and Synthesis
نویسنده
چکیده
Laughter is a nonverbal vocalization that occur often in continuous speech. It is produced by the speech production mechanism using a highly variable physiological process. The vocalized expression of laughter varies across gender, individuals and context. Despite its variability, laughter is perceived naturally by humans. Since laughter is produced by the human speech production mechanism, spectral features were generally used for the study of laughter acoustics. This work mainly aims at showing the significance of excitation information for analysis of laugh signals. We proposed acoustic features which are motivated from the production characteristics of laughter. The features are based on pitch period (T0), the strength of excitation derived from zero-frequency filtered signal, amount of breathiness and loudness measure. It is observed that there will be sudden bursts of air flow through the vocal tract in the case of laughter. This will result in faster vibration of the vocal folds, and hence reduction in the pitch period. Apart from decrease in the pitch period, there is also a raising pattern in the pitch period contour. Similar observations are also made in case of strength of excitation. The strength of excitation rises sharply and then falls almost at the same rate. Since there will be more air flow, laughter is typically accompanied by some amount of breathiness. It will be reflected perceptually as less louder and more noisy. These loudness and non-determenistic component (noise) in the signal are estimated using measures based on Hilbert envelope. A method is proposed using these features for detecting laughter in continuous speech. The method is tested on a noisy data (TV show data) and a clean data (AMI Corpus) and the results are reported. A method is also proposed for synthesizing isolated laughter by modifying the above iv features. The perceptual significance of each of the features is studied using an analysis by synthesis based approach. The affect of each of the features are studied by deemphasizing them from some original laugh signals, and performing perceptual evaluation on the obtained signals. The study indicated that pitch period pattern is the most significant factor, followed by call-intercall duration, breathiness and strength of excitation.
منابع مشابه
Automatic Sleep Stages Detection Based on EEG Signals Using Combination of Classifiers
Sleep stages classification is one of the most important methods for diagnosis in psychiatry and neurology. In this paper, a combination of three kinds of classifiers are proposed which classify the EEG signal into five sleep stages including Awake, N-REM (non-rapid eye movement) stage 1, N-REM stage 2, N-REM stage 3 and 4 (also called Slow Wave Sleep), and REM. Twenty-five all night recordings...
متن کاملK-Complex Detection Based on Synchrosqueezing Transform
K-complex is an underlying pattern in the sleep EEG. Due to the role of sleep studies inneurophysiologic and cognitive disorders diagnosis, reliable methods for analysis and detection of this patternare of great importance. In our previous work, Synchrosqueezing Transform (SST) was proposed for analysisof this pattern. SST is an EMD-like tool, which benefits from wavelet transform and reallocat...
متن کاملEpileptic seizure detection based on The Limited Penetrable visibility graph algorithm and graph properties
Introduction: Epileptic seizure detection is a key step for both researchers and epilepsy specialists for epilepsy assessment due to the non-stationariness and chaos in the electroencephalogram (EEG) signals. Current research is directed toward the development of an efficient method for epilepsy or seizure detection based the limited penetrable visibility graph (LPVG) algorith...
متن کاملDimensionality Reduction and Improving the Performance of Automatic Modulation Classification using Genetic Programming (RESEARCH NOTE)
This paper shows how we can make advantage of using genetic programming in selection of suitable features for automatic modulation recognition. Automatic modulation recognition is one of the essential components of modern receivers. In this regard, selection of suitable features may significantly affect the performance of the process. Simulations were conducted with 5db and 10db SNRs. Test and ...
متن کاملUsing Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media
Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010